Dataset statistics
| Number of variables | 40 |
|---|---|
| Number of observations | 26990 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.7 MiB |
| Average record size in memory | 104.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 1 |
| Boolean | 32 |
is_starrable_False has constant value "True" | Constant |
location_state_England is highly correlated with country_GB and 1 other fields | High correlation |
country_GB is highly correlated with location_state_England and 1 other fields | High correlation |
country_US is highly correlated with location_state_England and 1 other fields | High correlation |
location_state_England is highly correlated with country_GB and 1 other fields | High correlation |
country_GB is highly correlated with location_state_England and 1 other fields | High correlation |
country_US is highly correlated with location_state_England and 1 other fields | High correlation |
location_state_England is highly correlated with country_GB and 1 other fields | High correlation |
country_GB is highly correlated with location_state_England and 1 other fields | High correlation |
country_US is highly correlated with location_state_England and 1 other fields | High correlation |
category_Other is highly correlated with is_starrable_False | High correlation |
country_AU is highly correlated with is_starrable_False | High correlation |
category_Performances is highly correlated with is_starrable_False | High correlation |
category_Poetry is highly correlated with is_starrable_False | High correlation |
location_state_PA is highly correlated with is_starrable_False | High correlation |
country_CA is highly correlated with is_starrable_False | High correlation |
location_state_MA is highly correlated with is_starrable_False | High correlation |
location_state_CA is highly correlated with is_starrable_False | High correlation |
country_IT is highly correlated with is_starrable_False | High correlation |
staff_pick_True is highly correlated with is_starrable_False | High correlation |
country_MX is highly correlated with is_starrable_False | High correlation |
country_ES is highly correlated with is_starrable_False | High correlation |
location_state_England is highly correlated with country_GB and 2 other fields | High correlation |
location_state_IL is highly correlated with is_starrable_False | High correlation |
country_GB is highly correlated with location_state_England and 2 other fields | High correlation |
category_Jewelry is highly correlated with is_starrable_False | High correlation |
location_state_Other is highly correlated with is_starrable_False | High correlation |
location_state_TX is highly correlated with is_starrable_False | High correlation |
country_DE is highly correlated with is_starrable_False | High correlation |
category_Graphic Novels is highly correlated with is_starrable_False | High correlation |
category_Wearables is highly correlated with is_starrable_False | High correlation |
location_state_WA is highly correlated with is_starrable_False | High correlation |
country_Other is highly correlated with is_starrable_False | High correlation |
category_Narrative Film is highly correlated with is_starrable_False | High correlation |
country_US is highly correlated with location_state_England and 2 other fields | High correlation |
category_Tabletop Games is highly correlated with is_starrable_False | High correlation |
location_state_NY is highly correlated with is_starrable_False | High correlation |
is_starrable_False is highly correlated with category_Other and 31 other fields | High correlation |
category_Dance is highly correlated with is_starrable_False | High correlation |
location_state_FL is highly correlated with is_starrable_False | High correlation |
category_Classical Music is highly correlated with is_starrable_False | High correlation |
country_FR is highly correlated with is_starrable_False | High correlation |
state is highly correlated with is_starrable_False | High correlation |
location_state_CA is highly correlated with location_state_Other | High correlation |
location_state_England is highly correlated with country_GB and 1 other fields | High correlation |
location_state_Other is highly correlated with location_state_CA | High correlation |
country_CA is highly correlated with country_US | High correlation |
country_GB is highly correlated with location_state_England and 1 other fields | High correlation |
country_US is highly correlated with location_state_England and 2 other fields | High correlation |
goal is highly skewed (γ1 = 61.00201993) | Skewed |
id has unique values | Unique |
prep_time has 2948 (10.9%) zeros | Zeros |
weekday_of_launch has 4659 (17.3%) zeros | Zeros |
hour_of_launch has 1045 (3.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-05-18 20:35:53.452982 |
|---|---|
| Analysis finished | 2022-05-18 20:36:18.171706 |
| Duration | 24.72 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1567 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29512.47891 |
| Minimum | 0.01 |
|---|---|
| Maximum | 50000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 421.7 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 250 |
| Q1 | 1000 |
| median | 3500 |
| Q3 | 10000 |
| 95-th percentile | 50000 |
| Maximum | 50000000 |
| Range | 49999999.99 |
| Interquartile range (IQR) | 9000 |
Descriptive statistics
| Standard deviation | 504033.4892 |
|---|---|
| Coefficient of variation (CV) | 17.07865648 |
| Kurtosis | 4807.466613 |
| Mean | 29512.47891 |
| Median Absolute Deviation (MAD) | 3000 |
| Skewness | 61.00201993 |
| Sum | 796541805.9 |
| Variance | 2.540497583 × 1011 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 5000 | 1747 | 6.5% |
| 1000 | 1325 | 4.9% |
| 10000 | 1315 | 4.9% |
| 500 | 1266 | 4.7% |
| 3000 | 1208 | 4.5% |
| 2000 | 1170 | 4.3% |
| 1500 | 831 | 3.1% |
| 2500 | 830 | 3.1% |
| 15000 | 755 | 2.8% |
| 20000 | 631 | 2.3% |
| Other values (1557) | 15912 |
| Value | Count | Frequency (%) |
| 0.01 | 1 | < 0.1% |
| 1 | 59 | |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 8 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 3 | < 0.1% |
| 10 | 47 | |
| 12 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 50000000 | 1 | < 0.1% |
| 33000000 | 1 | < 0.1% |
| 25000000 | 1 | < 0.1% |
| 20000000 | 3 | |
| 10000000 | 7 | |
| 9000000 | 2 | < 0.1% |
| 7500000 | 1 | < 0.1% |
| 7300000 | 1 | < 0.1% |
| 6500001 | 1 | < 0.1% |
| 6000000 | 1 | < 0.1% |
| Distinct | 26990 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1067811742 |
| Minimum | 53154 |
|---|---|
| Maximum | 2147466649 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 421.7 KiB |
Quantile statistics
| Minimum | 53154 |
|---|---|
| 5-th percentile | 103745519.9 |
| Q1 | 523759718.2 |
| median | 1064504786 |
| Q3 | 1608616310 |
| 95-th percentile | 2038007840 |
| Maximum | 2147466649 |
| Range | 2147413495 |
| Interquartile range (IQR) | 1084856591 |
Descriptive statistics
| Standard deviation | 623465831.2 |
|---|---|
| Coefficient of variation (CV) | 0.583872425 |
| Kurtosis | -1.219457615 |
| Mean | 1067811742 |
| Median Absolute Deviation (MAD) | 541931476 |
| Skewness | 0.007666229066 |
| Sum | 2.882023891 × 1013 |
| Variance | 3.887096427 × 1017 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 522081392 | 1 | < 0.1% |
| 979492837 | 1 | < 0.1% |
| 600835406 | 1 | < 0.1% |
| 1733489739 | 1 | < 0.1% |
| 570940247 | 1 | < 0.1% |
| 670938521 | 1 | < 0.1% |
| 2084970699 | 1 | < 0.1% |
| 1815928657 | 1 | < 0.1% |
| 1635909422 | 1 | < 0.1% |
| 340941897 | 1 | < 0.1% |
| Other values (26980) | 26980 |
| Value | Count | Frequency (%) |
| 53154 | 1 | |
| 113230 | 1 | |
| 127800 | 1 | |
| 171116 | 1 | |
| 274865 | 1 | |
| 285583 | 1 | |
| 325875 | 1 | |
| 379873 | 1 | |
| 787066 | 1 | |
| 911712 | 1 |
| Value | Count | Frequency (%) |
| 2147466649 | 1 | |
| 2147460119 | 1 | |
| 2147430599 | 1 | |
| 2147416747 | 1 | |
| 2147380316 | 1 | |
| 2147364781 | 1 | |
| 2147339483 | 1 | |
| 2147336747 | 1 | |
| 2147180546 | 1 | |
| 2147034766 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 421.7 KiB |
| successful | |
|---|---|
| failed |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 8.844164505 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | successful |
|---|---|
| 2nd row | successful |
| 3rd row | successful |
| 4th row | successful |
| 5th row | successful |
Common Values
| Value | Count | Frequency (%) |
| successful | 19191 | |
| failed | 7799 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| successful | 19191 | |
| failed | 7799 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
campaign_length
Real number (ℝ≥0)
| Distinct | 85 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.45075954 |
| Minimum | 1 |
|---|---|
| Maximum | 91 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 421.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 14 |
| Q1 | 28 |
| median | 30 |
| Q3 | 33 |
| 95-th percentile | 60 |
| Maximum | 91 |
| Range | 90 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 12.00175685 |
|---|---|
| Coefficient of variation (CV) | 0.3816046743 |
| Kurtosis | 1.839634985 |
| Mean | 31.45075954 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.9733681399 |
| Sum | 848856 |
| Variance | 144.0421675 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 30 | 10046 | |
| 29 | 1642 | 6.1% |
| 60 | 1482 | 5.5% |
| 31 | 866 | 3.2% |
| 21 | 785 | 2.9% |
| 45 | 764 | 2.8% |
| 14 | 745 | 2.8% |
| 20 | 691 | 2.6% |
| 28 | 621 | 2.3% |
| 35 | 600 | 2.2% |
| Other values (75) | 8748 |
| Value | Count | Frequency (%) |
| 1 | 19 | 0.1% |
| 2 | 10 | < 0.1% |
| 3 | 25 | 0.1% |
| 4 | 20 | 0.1% |
| 5 | 72 | 0.3% |
| 6 | 69 | 0.3% |
| 7 | 192 | |
| 8 | 55 | 0.2% |
| 9 | 78 | 0.3% |
| 10 | 200 |
| Value | Count | Frequency (%) |
| 91 | 1 | < 0.1% |
| 90 | 23 | |
| 89 | 21 | |
| 88 | 8 | < 0.1% |
| 86 | 2 | < 0.1% |
| 84 | 1 | < 0.1% |
| 83 | 2 | < 0.1% |
| 82 | 2 | < 0.1% |
| 81 | 2 | < 0.1% |
| 80 | 3 | < 0.1% |
| Distinct | 769 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.58947758 |
| Minimum | 0 |
|---|---|
| Maximum | 3318 |
| Zeros | 2948 |
| Zeros (%) | 10.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 421.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 13 |
| Q3 | 37 |
| 95-th percentile | 192 |
| Maximum | 3318 |
| Range | 3318 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 136.3179994 |
|---|---|
| Coefficient of variation (CV) | 2.864456731 |
| Kurtosis | 128.901424 |
| Mean | 47.58947758 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 9.231220255 |
| Sum | 1284440 |
| Variance | 18582.59696 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2948 | 10.9% |
| 1 | 1457 | 5.4% |
| 2 | 1255 | 4.6% |
| 3 | 1140 | 4.2% |
| 4 | 985 | 3.6% |
| 5 | 929 | 3.4% |
| 6 | 881 | 3.3% |
| 7 | 810 | 3.0% |
| 8 | 723 | 2.7% |
| 9 | 632 | 2.3% |
| Other values (759) | 15230 |
| Value | Count | Frequency (%) |
| 0 | 2948 | |
| 1 | 1457 | |
| 2 | 1255 | |
| 3 | 1140 | 4.2% |
| 4 | 985 | 3.6% |
| 5 | 929 | 3.4% |
| 6 | 881 | 3.3% |
| 7 | 810 | 3.0% |
| 8 | 723 | 2.7% |
| 9 | 632 | 2.3% |
| Value | Count | Frequency (%) |
| 3318 | 1 | |
| 3303 | 1 | |
| 3250 | 1 | |
| 3046 | 1 | |
| 2815 | 1 | |
| 2685 | 1 | |
| 2633 | 1 | |
| 2517 | 1 | |
| 2432 | 1 | |
| 2400 | 1 |
month_of_launch
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.087106336 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 421.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.395907488 |
|---|---|
| Coefficient of variation (CV) | 0.5578853565 |
| Kurtosis | -1.235508877 |
| Mean | 6.087106336 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.1725523548 |
| Sum | 164291 |
| Variance | 11.53218767 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=12)
| Value | Count | Frequency (%) |
| 3 | 2901 | |
| 2 | 2852 | |
| 4 | 2726 | |
| 5 | 2552 | |
| 10 | 2339 | |
| 11 | 2182 | |
| 1 | 2121 | |
| 9 | 2098 | |
| 7 | 2029 | |
| 6 | 1897 | |
| Other values (2) | 3293 |
| Value | Count | Frequency (%) |
| 1 | 2121 | |
| 2 | 2852 | |
| 3 | 2901 | |
| 4 | 2726 | |
| 5 | 2552 | |
| 6 | 1897 | |
| 7 | 2029 | |
| 8 | 1819 | |
| 9 | 2098 | |
| 10 | 2339 |
| Value | Count | Frequency (%) |
| 12 | 1474 | |
| 11 | 2182 | |
| 10 | 2339 | |
| 9 | 2098 | |
| 8 | 1819 | |
| 7 | 2029 | |
| 6 | 1897 | |
| 5 | 2552 | |
| 4 | 2726 | |
| 3 | 2901 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.386031864 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 4659 |
| Zeros (%) | 17.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 421.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.797575302 |
|---|---|
| Coefficient of variation (CV) | 0.7533743908 |
| Kurtosis | -0.90589679 |
| Mean | 2.386031864 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.3676106292 |
| Sum | 64399 |
| Variance | 3.231276965 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 1 | 5747 | |
| 0 | 4659 | |
| 2 | 4538 | |
| 3 | 4160 | |
| 4 | 4026 | |
| 5 | 2168 | 8.0% |
| 6 | 1692 | 6.3% |
| Value | Count | Frequency (%) |
| 0 | 4659 | |
| 1 | 5747 | |
| 2 | 4538 | |
| 3 | 4160 | |
| 4 | 4026 | |
| 5 | 2168 | 8.0% |
| 6 | 1692 | 6.3% |
| Value | Count | Frequency (%) |
| 6 | 1692 | 6.3% |
| 5 | 2168 | 8.0% |
| 4 | 4026 | |
| 3 | 4160 | |
| 2 | 4538 | |
| 1 | 5747 | |
| 0 | 4659 |
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.67021119 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 1045 |
| Zeros (%) | 3.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 421.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 15 |
| Q3 | 19 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 6.68715033 |
|---|---|
| Coefficient of variation (CV) | 0.4891768121 |
| Kurtosis | -0.7088038079 |
| Mean | 13.67021119 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.6383662374 |
| Sum | 368959 |
| Variance | 44.71797954 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) |
| 16 | 2104 | 7.8% |
| 17 | 2042 | 7.6% |
| 15 | 1896 | 7.0% |
| 18 | 1803 | 6.7% |
| 19 | 1711 | 6.3% |
| 14 | 1581 | 5.9% |
| 20 | 1513 | 5.6% |
| 21 | 1423 | 5.3% |
| 22 | 1370 | 5.1% |
| 13 | 1229 | 4.6% |
| Other values (14) | 10318 |
| Value | Count | Frequency (%) |
| 0 | 1045 | |
| 1 | 941 | |
| 2 | 851 | |
| 3 | 719 | |
| 4 | 711 | |
| 5 | 591 | |
| 6 | 452 | |
| 7 | 525 | |
| 8 | 524 | |
| 9 | 512 |
| Value | Count | Frequency (%) |
| 23 | 1220 | |
| 22 | 1370 | |
| 21 | 1423 | |
| 20 | 1513 | |
| 19 | 1711 | |
| 18 | 1803 | |
| 17 | 2042 | |
| 16 | 2104 | |
| 15 | 1896 | |
| 14 | 1581 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 23548 | |
| True | 3442 | 12.8% |
location_state_England
Boolean
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 24315 | |
| True | 2675 | 9.9% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 824 |
| Value | Count | Frequency (%) |
| False | 26166 | |
| True | 824 | 3.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 689 |
| Value | Count | Frequency (%) |
| False | 26301 | |
| True | 689 | 2.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 587 |
| Value | Count | Frequency (%) |
| False | 26403 | |
| True | 587 | 2.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 24262 | |
| True | 2728 | 10.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 13668 | |
| True | 13322 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 589 |
| Value | Count | Frequency (%) |
| False | 26401 | |
| True | 589 | 2.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 987 |
| Value | Count | Frequency (%) |
| False | 26003 | |
| True | 987 | 3.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 630 |
| Value | Count | Frequency (%) |
| False | 26360 | |
| True | 630 | 2.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 545 |
| Value | Count | Frequency (%) |
| False | 26445 | |
| True | 545 | 2.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 1238 |
| Value | Count | Frequency (%) |
| False | 25752 | |
| True | 1238 | 4.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 477 |
| Value | Count | Frequency (%) |
| False | 26513 | |
| True | 477 | 1.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 313 |
| Value | Count | Frequency (%) |
| False | 26677 | |
| True | 313 | 1.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 424 |
| Value | Count | Frequency (%) |
| False | 26566 | |
| True | 424 | 1.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 23896 | |
| True | 3094 | 11.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 368 |
| Value | Count | Frequency (%) |
| False | 26622 | |
| True | 368 | 1.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 288 |
| Value | Count | Frequency (%) |
| False | 26702 | |
| True | 288 | 1.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 1087 |
| Value | Count | Frequency (%) |
| False | 25903 | |
| True | 1087 | 4.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 18930 | |
| False | 8060 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 2388 |
| Value | Count | Frequency (%) |
| False | 24602 | |
| True | 2388 | 8.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 1367 |
| Value | Count | Frequency (%) |
| False | 25623 | |
| True | 1367 | 5.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 2338 |
| Value | Count | Frequency (%) |
| False | 24652 | |
| True | 2338 | 8.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 1723 |
| Value | Count | Frequency (%) |
| False | 25267 | |
| True | 1723 | 6.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 2391 |
| Value | Count | Frequency (%) |
| False | 24599 | |
| True | 2391 | 8.9% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 17154 | |
| True | 9836 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 1224 |
| Value | Count | Frequency (%) |
| False | 25766 | |
| True | 1224 | 4.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 1656 |
| Value | Count | Frequency (%) |
| False | 25334 | |
| True | 1656 | 6.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 1352 |
| Value | Count | Frequency (%) |
| False | 25638 | |
| True | 1352 | 5.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True | 1530 |
| Value | Count | Frequency (%) |
| False | 25460 | |
| True | 1530 | 5.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 22390 | |
| True | 4600 | 17.0% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| goal | id | state | campaign_length | prep_time | month_of_launch | weekday_of_launch | hour_of_launch | location_state_CA | location_state_England | location_state_FL | location_state_IL | location_state_MA | location_state_NY | location_state_Other | location_state_PA | location_state_TX | location_state_WA | country_AU | country_CA | country_DE | country_ES | country_FR | country_GB | country_IT | country_MX | country_Other | country_US | category_Classical Music | category_Dance | category_Graphic Novels | category_Jewelry | category_Narrative Film | category_Other | category_Performances | category_Poetry | category_Tabletop Games | category_Wearables | staff_pick_True | is_starrable_False | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10000.0 | 522081392 | successful | 49 | 17 | 2 | 6 | 20 | True | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | True | False | False | False | False | False | False | True |
| 1 | 3000.0 | 688419156 | successful | 30 | 61 | 1 | 3 | 18 | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | False | True | False | False | False | False | True |
| 2 | 20000.0 | 1395612011 | successful | 29 | 10 | 2 | 1 | 13 | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | True | False | False | False | False | True | True |
| 3 | 3000.0 | 1895670076 | successful | 29 | 7 | 3 | 4 | 13 | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | True | False | False | False | False | False | False | True |
| 4 | 3000.0 | 273779926 | successful | 30 | 57 | 10 | 3 | 15 | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | True |
| 5 | 30000.0 | 1905826891 | successful | 31 | 21 | 4 | 2 | 15 | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | True | False | False | False | False | False | True | True |
| 6 | 70000.0 | 1606387274 | failed | 31 | 146 | 6 | 0 | 19 | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | True | False | False | False | False | False | True |
| 7 | 5000.0 | 1461979375 | successful | 15 | 126 | 2 | 1 | 16 | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | True |
| 8 | 24953.0 | 1351145783 | failed | 26 | 32 | 1 | 1 | 23 | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | True | False | False | False | False | True | True |
| 9 | 2000.0 | 1492834939 | successful | 30 | 44 | 11 | 5 | 22 | True | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | True | False | False | False | False | False | False | False | False | False | True |
Last rows
| goal | id | state | campaign_length | prep_time | month_of_launch | weekday_of_launch | hour_of_launch | location_state_CA | location_state_England | location_state_FL | location_state_IL | location_state_MA | location_state_NY | location_state_Other | location_state_PA | location_state_TX | location_state_WA | country_AU | country_CA | country_DE | country_ES | country_FR | country_GB | country_IT | country_MX | country_Other | country_US | category_Classical Music | category_Dance | category_Graphic Novels | category_Jewelry | category_Narrative Film | category_Other | category_Performances | category_Poetry | category_Tabletop Games | category_Wearables | staff_pick_True | is_starrable_False | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 26980 | 10000.0 | 693897183 | successful | 50 | 20 | 11 | 2 | 4 | True | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | False | True |
| 26981 | 500.0 | 2028386415 | failed | 29 | 0 | 3 | 1 | 7 | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | True | False | False | False | False | False | True |
| 26982 | 1500.0 | 645573052 | successful | 29 | 44 | 3 | 2 | 12 | False | True | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | True |
| 26983 | 8000.0 | 274524291 | successful | 50 | 57 | 7 | 1 | 1 | False | False | True | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | True | False | True |
| 26984 | 22000.0 | 554027675 | successful | 33 | 14 | 10 | 1 | 16 | True | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | True | False | False | False | False | True | True |
| 26985 | 30000.0 | 457952399 | failed | 35 | 30 | 2 | 3 | 12 | True | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | True | False | True |
| 26986 | 1000.0 | 2127789307 | successful | 33 | 7 | 8 | 1 | 20 | False | True | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | True |
| 26987 | 15000.0 | 1362143084 | failed | 29 | 0 | 9 | 1 | 13 | False | True | False | False | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | False | True | False | False | False | True |
| 26988 | 1100.0 | 1674716636 | failed | 60 | 0 | 11 | 3 | 3 | False | False | False | False | False | False | True | False | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | True |
| 26989 | 4000.0 | 2114005133 | successful | 41 | 33 | 3 | 0 | 7 | False | False | False | False | False | False | False | False | False | True | False | False | False | False | False | False | False | False | False | True | True | False | False | False | False | False | False | False | False | False | False | True |